The problems of punctuation ambiguity in fully automatic text-to-speech conversion
نویسنده
چکیده
Fully automatic text-to-speech systems must accept as input any texts in whatever form they might be stored on a computer. As such, the role of punctuation characters in marking sentences, phrases and other textual constructs has to be exploited to produce natural sounding synthetic speech. Some characters not in the alpha-numeric set can, however, act both as text and as punctuation in different situations. A pre-processing module has therefore been implemented which is sensitive to these different roles and attempts to use them in preparing texts for text-to-speech conversion.
منابع مشابه
Development and Evaluation of Automatic Punctuation for French and English Speech-to-Text
Automatic punctuation of speech is important to make speechto-text output more readable and to facilitate downstream language processing. This paper describes the development of an automatic punctuation system for French and English. The punctuation model uses both textual information and acoustic (prosodic) information and is based on adaptive boosting. The system is evaluated on a challenging...
متن کاملA System Description of P^4: Possible Punctuation Points Parser
We present a Natural Language Understanding (NLU) implementation that automatically inserts punctuation marks into a sequence of words to create a group of one or more syntactically correct sentences. The software, Possible Punctuation Points Parser (P^4) provides the ability for the user to input a string of words to process, performs the punctuation possibilities, and then provides several vi...
متن کاملPunctuation has a point, so use it!
It is all too common for systems processing natural language, whether for input (automatic speech recognition, text queries, dialogue etc.) or output (text-to-speech), to ignore or strip out punctuation. The effect of prosodic factors, such as intonation and pausing, on language processing remains controversial. While there is an obvious relationship between punctuation and prosody it cannot be...
متن کاملPunctuation Prediction with Transition-based Parsing
Punctuations are not available in automatic speech recognition outputs, which could create barriers to many subsequent text processing tasks. This paper proposes a novel method to predict punctuation symbols for the stream of words in transcribed speech texts. Our method jointly performs parsing and punctuation prediction by integrating a rich set of syntactic features when processing words fro...
متن کاملAutomatic Recovery of Punctuation Marks and Capitalization Information for Iberian Languages
This paper shows experimental results concerning automatic enrichment of the speech recognition output with punctuation marks and capitalization information. The two tasks are treated as two classification problems, using a maximum entropy modeling approach. The approach is language independent as reinforced by experiments performed on Portuguese and Spanish Broadcast News corpora. The discrimi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1989